Skip to content

[Build] Adjust nvcc_threads for CI#27296

Merged
tianleiwu merged 1 commit intomainfrom
tlwu/ci_workflow_nvcc_threads
Feb 11, 2026
Merged

[Build] Adjust nvcc_threads for CI#27296
tianleiwu merged 1 commit intomainfrom
tlwu/ci_workflow_nvcc_threads

Conversation

@tianleiwu
Copy link
Contributor

We recently updated flash attention and have more cu files. Each cu file need a lot of CPU memory to compile.

Previously, we did not set nvcc_threads, and number_of_nvcc_threads() returns 3 for A10 build machine. That number is too large since the memory is limited (55 GB), while number of parallel is large. For example, if the machine has 8 cpu cores, 8 * 3 means there are total 24 nvcc threads. It might encounter out of memory.

Here we update the number_of_nvcc_threads() to use updated number of flash attention cu files, and explicitly set nvcc_threads in CI build to avoid out-of-memory in build.

@tianleiwu tianleiwu enabled auto-merge (squash) February 10, 2026 23:45
@tianleiwu tianleiwu merged commit 4295524 into main Feb 11, 2026
98 of 102 checks passed
@tianleiwu tianleiwu deleted the tlwu/ci_workflow_nvcc_threads branch February 11, 2026 08:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants